Natural statistics for spectral samples 



E. Di Nardo * P. McCullagh J D. Senato 



Abstract 

Spectral sampling is associated with the group of unitary transformations acting 
on matrices in much the same way that simple random sampling is associated with 
the symmetric group acting on vectors. This parallel extends to symmetric functions, 
fc-statistics and polykays. We construct spectral fc-statistics as unbiased estimators 
of cumulants of trace powers of a suitable random matrix. Moreover we define nor- 
malized spectral polykays in such a way that when the sampling is from an infinite 
population they return products of free cumulants. 

keywords: random matrix, cumulant of traces, free cumulant, polykay 

1 Outline 

The goals of this paper are threefold. 

We first introduce the notion of spectral sampling as an operation on a finite set of n 
real numbers x = (xi, . . . ,x n ) generating a random set y = (yi, . . . ,y m ) of m < n real 
numbers whose distribution is determined by x. Spectral sampling is not the same as 
simple random sampling in the sense that y is not a subset of x, but the parallels are 
unmistakeable and striking. In particular, there exist symmetric functions R\ — analogous 
to fc-statistics and polykays — such that E(R\(y) \ x) = R\(x). In other words, the average 
value of ^a(-) for spectral samples y taken from x is equal to ^a(^)- The first goal is 
to obtain explicit expressions for these spectral /c-statistics, which is done in sections 3-5 
using symbolic umbral techniques. 

The second goal is to elucidate some of the concepts associated with freeness — free 
probability and free cumulants — in terms of spectral sampling and spectral fc-statistics. 
For this purpose, spectral sampling may be viewed as a restriction operation X h-» Y 
from a freely randomized Hermitian matrix of order n into a freely randomized Hermitian 
matrix of order m < n, and each spectral fc-statistic is class function depending only on the 
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matrix eigenvalues. In essence, the spectral fc-statistics tell us which spectral properties 
are preserved on average by freely randomized matrix restriction. For example 8/i)(x) = x 
tells us that the eigenvalue average is preserved. Likewise, if k 2 denotes the usual sample 
variance with divisor n — 1, the second spectral statistic &(2)(x) = k 2 (x)/(n + 1) tells us 
that the eigenvalue sample variance is not preserved, but is, on average, proportional to 
the sample size plus one. 

Finally, by considering the limit as n — > oo, we show that the normalized spectral k- 
statistics are related to free cumulants in much the same way that polykays are related to 
ordinary cumulants. 

2 Spectral sampling 
2.1 Definition 

A random Hermitian matrix A of order n is said to be freely randomized if its distribution 
is invariant under unitary conjugation, i.e. A ~ GAG^ for each unitary G. In particular, if 
H is uniformly distributed with respect to Haar measure on the group of unitary matrices 
of order n, HAH^ is freely randomized. If A is freely randomized, each leading sub-matrix 
is also freely randomized. 

Let x = (xi, . . . , x n ) be given real numbers, let X = diag(a^) be the associated diagonal 
matrix, and let HXW be the freely randomized matrix. The sample matrix Y is the 
leading 77i x m sub-matrix in the freely randomized matrix, i.e. Y — (HXH ) [rnxm] • 

Definition 2.1 (Spectral sample). The set of eigenvalues y = (yi, . . . ,y m ) G IR m of the 
m x m Hermitian random matrix Y = [HXw)\ mKm ] is called a spectral sample of size m 
from x. 

For m = n, the distribution is uniform with the same weight l/n\ on all permutations 
o G & n , i-e. y is a random permutation of x. For m < n, however, the distribution in R m 
is non-atomic, so the sample values y do not ordinarily occur among the components of x. 

If the group of unitary transformations in the preceding definition were replaced by 
a sub-group, the sampling distribution would be altered accordingly. The most obvious 
subgroups are the group of orthogonal transformations and the group of permutations [n] — > 
[n]; in each case there is an associated family of spectral functions such that E(R\(y) \ x) = 
&\(x). In particular, if if is a uniform random permutation, y is a simple random sample 
of size m taken from x, and the associated spectral functions are the classical fc-statistics 
due to Fisher (1929) [9] and the polykays due to Tukey (1950) [21] ■ 

Remark 2.2. Within image compression [TJ], the random hermitian matrix Y in Definition 
12. H is called a two-dimensional Haar transform. More generally, if X is a full matrix whose 
entries are the pixels ranging from (black) to 255 (white), then Y contains reduced 
information extracted from X via the rectangular Haar matrix H. Similar transformations 
are employed also within classification, document analysis, hardware implementation and 
are known as downsampling of a vector or a matrix [19J. 
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2.2 Natural statistics 



For present purposes, a statistic T is a collection of functions T n : R™ — > R such that T m (y) 
and T n (x) are defined for all samples sufficiently large. For example, the usual sample 
variance is defined for n > 2, while the sample skewness is defined for n > 3. 

Definition 2.3 (Natural statistic). A statistic T is said to be natural if, for each m < n, 
the average value of T m (-) over random samples y drawn from x is equal to T n (x). In 
symbols, 

E(T m (y)\x) =T n (x) 

for each m < n. 

Obviously, the definition depends on what it means for y to be a random sample drawn 
from x, i.e. the choice of group in definition 12.11 Thus, a statistic that is natural with 
respect to simple random sampling (a [/-statistic) is not, in general, natural with respect 
to spectral sampling. 

In Tukey [2T], such functions were said to be "inherited on the average". The key 
point in Definition 12.31 is that a natural statistic is not a single function in isolation, but 
a list of functions {T n : R n — > R}. It is the property of inheritance that gives these func- 
tions a common interpretation independent of the sample size. One might be inclined to 
think that inheritance is no different from unbiasedness relative to a model with exchange- 
ably distributed components. However, unbiasedness of T n does not imply the inheritance 
property, nor does inheritance imply that the statistic has a limit or that its expectation 
exists. Unbiasedness in parametric models is a property of individual functions T n , whereas 
inheritance is a property of the sequence. 

For m = n, inheritance implies that each T n is a symmetric function: T n (x) is equal to 
the average of the values on the permutations of x. Tukey [21] proved that the symmetric 
functions 

defined respectively for n > l,n > 2 and so on, are natural with respect to simple random 
sampling. Here and elsewhere (n) r = n(n — l) ■ ■ ■ (n — r + 1) denotes the descending factorial 
function. Ordinarily, we suppress the index n and write a. rs {x) instead of d rSin (cc), the value 
of n being inferred from the argument x G R™. The unnormalized polynomials 

Or(aj) = J^a;J, a rs {x) = ^XiXj, a rsk {x) = x\x)x k t , .... (2.2) 

are the well-known augmented symmetric functions [18J. 

Every expression which is a polynomial, symmetric and inherited on the average can 
be written as a linear combination of the statistics in (12.11) with coefficients that do not 
depend on the size of the set [2T]. Consequently each linear combination, with scalar 
coefficients independent of n, also has the inheritance property, as happens for example for 
[/-statistics. The combinations that have proved to be most useful for statistical purposes 
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are the fc-statistics due to Fisher [9] and the polykays due to Tukey [211 E2], defined as 
follows: 



% 2 ) = &(1 2 ), fe( 2 ) = 
fc(!3) = 0(i3), fe(i2) 

fc(l4) = 0(14), fc(l2 2 ) 




0(2) — 0(12) 

: 5(i2) - &(i 3 ), fe(3) = 5(3) - 35(i2) + 25(i3) 
= 5(i 2 2) — 5(i 4 ), fc(i3) = 5(i3) — 35(i2 2 ) + 25(j4) 



(2.3) 
(2.4) 
(2.5) 
(2.6) 



fc( 2 2) = 5(22) — 25(i2 2 ) + 0(i 4 ), fc(4) = 5(4) — 45(i3) — 3o( 2 2) + 125(i2 2 ) — 66(i4). 

The single index ks are the fc-statistics; the multi-index ks are the polykays. For a sample of 
iid variables, each fc-statistic is an unbiased estimator of the population cumulant, and each 
polykay is an unbiased estimator of cumulant products. The degree of each k is the sum 
of the subscripts. The set of natural polynomial statistics of degree i is a vector space, of 
dimension equal to the number of partitions of the integer i, spanned by the ks of degree %. 

3 Moment symbolic method 

Univariate case. The moment symbolic method relies on the classical umbral calculus 
introduced by Rota and Taylor in 1994 [16], which has been developed and refined in a series 
of papers starting from [7J [8]. The result is a calculus in which certain symbols represent 
scalar or polynomial sequences, thereby reducing the overall computational apparatus. We 
now review the key components. 

Let R be the real or complex field whose elements are called scalars. An umbral calculus 
consists of a generating set A = {a, (3, . . .}, called the alphabet, whose elements are named 
umbrae, a polynomial ring R[A] and a linear functional E: R[A] —> R called evaluation. 
The linear functional is such that E[l] = 1 and 



for any set of distinct umbrae in A and for i,j,k nonnegative integers. To each umbra 
a G A there corresponds a sequence of scalars = E[a 1 } for i = 0, 1, . . . such that ao = 1. 
The scalar ctj is called the zth moment of a. Indeed any scalar random variable possessing 
finite moments can be represented by an umbra. A scalar sequence {a{\ with ao = 1 is said 
to be represented by an umbra a if E[a 1 } = ai for i = 0, 1, . . .. 

Example 3.1. The sequence 1, 0, 0, ... is umbrally represented by the augmentation umbra 
e, and 1, 1, 1, . . . is umbrally represented by the unity umbra u. These are the umbral 
versions of two degenerate random variables such that P(X = 0) = 1 and P(Y = 1) = 1. 
The sequence of moments of a unit Poisson random variable is umbrally represented by the 
Bell umbra 0. This umbra plays a fundamental role in the symbolic method, as we will 
see later. Its ith moment is the Bell number, which is the coefficient of z l ji\ in the Taylor 
expansion of exp(e z — 1). □ 

Since an umbra is a formal object, questions involving the moment problem are not 
taken into account. Indeed, not every umbra corresponds to a real- valued random variable. 



Elofft • • • 7 fc ] = E[a { ] E[/3 j ] ■ ■ ■ E[y k }, (uncorrelation property) 
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Example 3.2. The sequence 1,1,0,0,... is represented by the singleton umbra x- Its 
variance E[x 2 } — E[x} 2 = —1 is negative, so there is no real- valued random variable corre- 
sponding to x- Nevertheless this umbra plays a fundamental role in dealing with cumulant 
sequences, as we will see later □. 

It is always possible to make the alphabet large enough so that, to each scalar sequence 
{cij}, there corresponds an umbra a G A, which is not necessarily unique. The same 
applies to identically distributed random variables. Two umbrae a and 7 having the same 
moment sequence are called similar, in symbols a = 7, and A contains an unlimited supply 
of distinct umbrae similar to a, usually denoted by a', a",.... If the sequence {a,} is 
umbrally represented by a, then 

the sequence \2 t a i } is represented by a + a = 2a 

-- - {so--} — - 

An expression such as 2a or a + a' is an example of an umbral polynomial, that is a 
polynomial p G R[A] in the umbrae of A. The support of an umbral polynomial is the set 
of all umbrae that occur in it. So the support of a + a' is {a, a'}, and the support of 2a 
is {a}. 

The formal power series 



e «z = U + Y^a i - eR[A][[z\] (3.2) 
i>i l - 

is the generating function of the umbra a. Moreover, each exponential formal power series 

f( z ) = l + J2a~ER[[z]] (3.3) 



i>l 



can be umbrally represented by a formal power series (13.21) in i^[^4][[^]], [20]. In fact, if 
the sequence 1, a\, 02, . . . is umbrally represented by a, the action of evaluation E can be 
extended coefficient-wise to formal power series (I3.2p . so that E[e az ] = f(z). For clarity 
we denote the generating function of a by f(a, z) = E[e az ]. Therefore a = a' iff f(a, z) = 
f(a',z). 

The first advantage of umbral notation is the representation of operations on generating 
functions with operations on umbrae. For example multiplication of exponential generating 
functions is umbrally represented by the sum of the corresponding umbrae, that is 

/(a + 7>*) = /(a>*)/(7>*)- 

Therefore f(a, z) 2 is the generating function of a + a', which is different from the generating 
function f(a,2z) of 2 a. The sum of generating functions is represented by the auxiliary 
umbra a + 7, named the disjoint sum of two umbrae, that is 

f(a + 7, z) = f(a, z) + 7(7, z) - 1, 
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so that E[(a + 7)*] = E[a l ] + E[Y] for all positive integers i. Then, 2f(a,z) — 1 is the 
generating function of a + a or a + a', and a + a = a 4- a'. 

It is also possible to compose generating functions and to represent the composition as 
the generating function of an umbra. First consider n uncorrelated umbrae a', a", . . . , a'" 
similar to a and take their sum: the resulting umbra a' + a" + ■ ■ • + a'", denoted by n.a, 
is called the dot product of the integer n and the umbra a. Its generating function is 
f(n.a, z) = If (a, z)] n and the moments are [3] 

E[{n.af\=Y^dx{n) m a x with d x = — — — ^— (3.4) 

where A is a partition of the integer z into /(A) parts, and a\ = d^a^ 2 • • • is the moment 
product |8J. The right hand side of fl3.4[) corresponds to £7[pfi + - • - + X n ) 1 } with Xl, . . . ,X n 
iid with moment sequence represented by the umbra a. In (13.41) . set E[(n.a) 1 ] = qi(n), 
which is a polynomial of degree % in n. If the integer n is replaced by any umbra 7 6 i, 
and = 7(7 — 1) • - • (7 — j + 1) denotes the descending factorial polynomial, then we 
have 54(7) = J2x\-i('y)iwd^ a ^- The symbol 7. a such that £'[(7. a) 1 ] = £'[^(7)] is called the 
dot-product of the umbrae a and 7. This last equality could be rewritten by using the 
umbral equivalence ~ such that p ~ q iff E\p\ = E[q] with p,q 6 i?[/4.]. Then we have 
(7.0)* ~ ^(7)- More generally, the umbral equivalence turns out to be useful in dealing 
with umbral polynomials with non-disjoint supports as we will see later. The replacement 
of the integer n with the umbra 7 is an example of the main device employed in the symbolic 
method, allowing us to represent more structured moment sequences starting from (13. 4p . 
Observe that we move from the generating function [f(a, z)] n to the generating function 
f(j.a,z) = /(7, log [/(a, z)]), which, is not yet the composition of f(a,z) and f(j,z). For 
this purpose, the umbra a in the dot product 7. a has to be replaced by a dot product 
involving the Bell umbra, that is /3.a. The dot product (3.a is called the a-partition umbra 
with generating function f((3.a,z) = exp(/(a,z) — 1). A special property which we use 
later is 

0.(ot + 7) = 0.ot + /3.7. (3.5) 
The symbol j.({3.a) has generating function which is the composition of f(a, z) and ^(7, z) 

f(j.{(3.a),z) = f{ 1 J(a,z)-l). (3.6) 
Parenthesis can be avoided since ^.((3. a) = (7./?). a. The moments are 

E[( r/ .p.a) i ] = ^2g m d x a x , (3.7) 

AH 

where {g^} are the moments of the umbra 7 [8J. 

Example 3.3. The composition umbra arises naturally in connection with random sums 
Xi + ■ ■ ■ + Xjy, where the Xs are iid, and iV is distributed independently of X. The 
cumulant generating function of the sum is the composition K^{Kx(f)) of the two gener- 
ating functions. In probability theory, iV is necessarily integer-valued, but there is no such 
constraint on the umbra 7. 
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Strictly connected to the composition umbra is the compositional inverse umbra a K x> 
of an umbra a, such that 

a <_1> ./?.a = x = a./3.a < ~ 1> . (3.8) 

A special compositional inverse umbra is -u <_1> , with u the unity umbra, having generating 
function 

f(u<- 1 >,z) = l + \og(l + z) (3.9) 

so that its i-th moment is 

EKu^Y] = (-l)^ 1 ^ - 1)!. (3.10) 

Multivariate case. Let {y\, . . . , u m } be a set of umbral monomials with support not 
necessarily disjoint. A vector sequence {gi}ien^ £ R, with g t = g.i x ^...^ m and g = 1, is 
represented by the m-tuple v — {i>\, . . . , u m ) if 

E[u i ]=g i , (3.11) 

for each multi-index i G N™. The elements {gi}iem^ in (13.1 ip are called multivariate 
moments of v. 

Remark 3.4. Within random variables, the m-tuple v = (yi, . . . , v m ) corresponds to a 
random vector (Xi, . . . , X m ). If {vi}^ are uncorrelated umbrae then gi = E^l 1 ] ■ ■ ■ E\y l ™\ 
and we recover the univariate symbolic method. The same happens if {i^}™! are umbral 
monomials with disjoint supports. 

As done in (I3.2p . the generating function of v is the formal power series 

EE v i e * ^ (3 - 12) 

k>l \i\=k 



gj^iziH Vv m z m _ u 



with z — (zi, . . . , z m ), \i\ — + - ■ •+i m and i\ = i\\ ■ ■ ■ i m \. If the sequence {gt} is umbrally 
represented by v and has (exponential) generating function 



f{z) = l + X, 1^9i^ (3.13) 

k>l \i\=k 



then E[e ulzl+ '" +UmZm ] = f(z). Taking into account (13. lip , the generating function in (13. 13|) 
is denoted by f(u,z). Two umbral vectors V\ and u 2 are said to be similar, in symbols 
u 1 = i/ 2 , iff /(fi, z) = f{v 2y z), that is E[u\] = E[v l 2 ] for all i G N™. They are said to be 
uncorrelated iff E[u\v%\ = E[u\]E[u 3 2 ] for all i,j G N™. 

An equation analogous to (13 .4p could be given for the multivariate case, provided that 
integer partitions are replaced with multi-index partitions jl]. A partition of a multi- index 
i is a composition A, whose columns are in lexicographic order, in symbols A h i. A 
composition A of a multi-index i is a matrix A = (Ay) of nonnegative integers and with 
no zero columns such that A r i + X r2 + • • ■ + X r k — V for r = 1,2, ... ,n. The number 
of columns of A is the length of A and denoted by /(A). As for integer partitions, the 
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notation A = (A^ 1 , A 2 2 , . . .) means that in the matrix A there are r± columns equal to Ai, 
r 2 columns equal to A 2 and so on, with Ai < A 2 < • • • . We set m(A) = (r 1 ,r 2 , . . .). The 
dot-product n.v of a nonnegative integer n and a m-tuple v is an auxiliary umbra denoting 
the summation v' + v" + • • • + u'" with {i/, u", . . . , u'"} a set of n uncorrelated and similar 
m-tuples. For i 6 and m-tuples u of umbral monomials, we have 

E[{n.u)*] = J]) -T^U (")«*) (3-14) 

where the sum is over all partitions A = (A^ 1 , A 2 2 , . . .) of the multi-index i, g\ = g r x x 9\ 2 • • • 
and = E[u Xi ]. The sequence in f !3.14j) are moments of a sum of iid random vectors with 
sequence of moments {gi}- If we replace the integer n in (I3.14p with the dot-product a. (3 
we get the auxiliary umbra at./3.u representing the sequence of moments 

EKa.fi.u)*] = r^U, oj(a) 9x, (3-15) 

AHi V " 

where the sequence {aj} is umbrally represented by a. In particular the generating function 
of the auxiliary umbra a.fl.u turns to be the composition of the univariate generating 
function f(a,z) and the multivariate generating function f(y,z) 

f(a.p.u,z) = f[a,f(u,z)-l]. (3.16) 

From Example I3.3[ the umbra a.fl.v is a generalization of a multivariate compound ran- 
domized Poisson random vector. In the next section, we show how this umbra allows us to 
write a formula for multivariate cumulants involving multi-index partitions. More details 
on the symbolic composition of multivariate formal power series can be found in [5]. 



4 Formal cumulants 



4.1 Definition 



Among the sequences of numbers related to a real- valued random variable, cumulants play 
a central role. Whether or not the sequence {a^} corresponds to the moments of some 
distribution, we define cumulants {cj} by the following equation: 



z 

f>l I i>l 



If a is an umbra representing the sequence {a{\ and K a is an umbra representing the 
sequence {q}, then by comparing (14.11) with (13.61) we have 

a = u.{3.K a , (4.2) 

since f(u,z) = exp(z). The umbra K a is called the a-cumulant umbra [3] and is such that 

f(K a ,z) = l + \og(f(a,z)). (4.3) 
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By comparing ( 14. 3 j) with ( 13. 6ft and ( 13. 9ft . we have 

K a = u^^.P-a. (4.4) 
Since m <_1> ./3.u = u <_1> ./3 = x, then equivalence ( 14. 4 p reduces to 

k q = 

The algebraic properties of cumulants can be formalized as 

(Homogeneity) x-( aa ) = a (x- a ), if a E R, (ak\ 
(Additivity) X'( a + 7) = X' a + X-l- 

The semi-invariance under translation follows from both equivalences in ( 14. 5 p by choosing 
as umbra a the unity umbra u. 

As done in ( 14. ip . multivariate formal cumulants {q} of a sequence of multivariate mo- 
ments {gi} can be defined via generating functions. Indeed, if {gi\ is umbrally represented 
by the m-tuple u, then the sequence {q} is umbrally represented by the m-tuple k v such 
that 

f(v,z) = ex P lf(K„,z)-l]. (4.6) 

The m-tuple k v is named i/-cumulant. By comparing ( 13. 16[) with ( 14. 6ft . the following 
equivalence follows 

i/ = u.f3.K v . (4.7) 

Equivalence ( 14 ,7p can be inverted in 

k v = u^.fi.u, with z) = 1 + log[/(i/, z)] (4.8) 

where iz <_1> is the compositional inverse of the unity umbra w. Moments of m <-1> ./3.i/ can 
be computed via equation ( I3.15P by recalling ( I3.10p . As done before, the umbra « <_1> ./3 
may be replaced by the umbra Xi so that 

k„ = x-v- (4.9) 

Thanks to this last representation, the algebraic properties of cumulants can be formalized 

as 

Homogeneity x-i.au) = a(x-v), if a E R, ^ 1Q , 

Additivity X-{ u i + u 2) = X' v \ + X M 2, if v \ and Vi are uncorrelated n-tuples. 

In the additivity property (I4.10p . we have used the disjoint sum of two m- tuples, that is 
E\(yx + 1^2)*] = -^[^i] + E[u\] for all i E N™. The semi-invariance under translation 
follows from both equivalences in ( I4.10p by choosing the m-tuple u — (u, . . . , u) as v. 
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4.2 Cumulants of trace powers 

Let us represent the eigenvalues of a random matrix M of dimension m by the m-tuple of 
umbral monomials u (z/ 1; . . . , z/ m ). Cumulants of f can be recovered via ( 14. 7p and ( 14. 8p . 
In this section we will characterize cumulants of Tr(M), that is cumulants of the sequence 
E[(ui H h v m ) k ] for A; = 1,2,.... Observe that 

f{vx + — + u m ,z) = E[e^ + ~+»™>] = f(u, z), (4.11) 

by using (13.121) and (13.131) with z = (z, . . . ,z). Compositions of multivariate formal power 
series like f(u, z) in (I4.1ip are represented by symbols with a peculiar expression. Indeed, if 
v is a m-tuple of umbral monomials with generating function f(u, z) and £ is a m-tuple of 
umbral monomials with generating function /(£, z), the m-tuple having generating function 
/[£, (f(u, z) - 1, ... , /(i/, z) - 1)] is (Ci + • • • + ej-/?-^ that is 

+ • • • + ej-/?^, *] = fit (/(", z)-l,..., /(i/, *) - 1)]. (4.12) 

As done in ( 14. 2 p and ( 14. 4p for univariate and multivariate cumulants respectively, in order 
to characterize cumulants of f(u, z) in ( 14. lip , we replace the m-tuple £ with the m-tuple 
u = (u, . . . , u) in ( I4.12p . Denote by c„ the m-tuple such that v = (« + ••• + u).f3.c u , that 
is 

f(y,z) = f[(u + --- + u).p.c v ,z] (4.13) 
with f(u, z) in (|4.11|). If c„ = (ci,„, . . . , Cm, w ) then /(c„, z) = f[d jV H h C m)I/ , *] and 

f[(u H h u).p.c v , z\ = exp{m(/[ci, 1/ H h c m ,„, z] - 1)}. 

Definition 4.1. For fixed m, formal cumulants of the sequence {E[(ui + • • • + f m ) fc ]} are 
umbrally represented by the umbral polynomial Ci jt , + • • • + t m , v , such that 

v\ H h i> m = m.p.fap H h Cm, v ), (4.14) 

TOt/j c„ = (ci,!/, . . . , c m ,i/) <?«t>en m ([^. 13\) . 

In order to prove that the moments of the umbral polynomial Ci „ + ■ • • + c m iy satisfy 
the characterizing algebraic properties of cumulants, we need to invert ( 14.141) . 

Proposition 4.2. We have t\ v + • • ■ + c mjU = .(vi + • • ■ + u m ). 

Proof. Indeed from (14.14[) , we have ^.{v\ H V v m ) = P-{t-\, v H I- Cm,„) so X-^-t^i + 

■ • ■ + ^m) = X-/^-( c i,^ + ■ ■ • + c TO) y). The result follows since = u. □ 

Thanks to Proposition 14.21 the umbral polynomial Ci 5 „ + - • ■ + c nijU is similar to an umbra 
like X'Pi with p e i?[^4]. So it has to satisfy the additivity and homogeneity properties like 
those in (j4TTU]) . 

Theorem 4.3. 
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Additivity: If v\ and v% are uncorrelated m-tuples, then 



Homogeneity: if a G R, then C\^ au ) H + c m ,( OI/ ) = a{d iV + h c miI/ ). 

Proof. Observe that a.(p + q) = a.p + a. q if c G i? and p, g are umbral polynomials 
with disjoint supports [3]. The previous equivalence holds in particular for a = 1/m and 
^11 + • ■ ■ + ^im) and q = (z/ 2j i + • • ■ + z/ 2jm ). From Proposition 14.21 



Ci,^^^ H H W!+"2 = •(? + 9) = X- 

m 



1 1 
— .p H .q 

m m 



From the additivity property in (I4.10j) we have 



X- 



1 1 

— -P H -9 

m m 



1 . 1 

— -P + X- — -Q 

m m 



for p and g umbral polynomials with disjoint supports. The result follows by observing 

that Ci )V H h c m ,u = and c i,i^ H ^ Cm,* = X-^-P- Tne homogeneity property 

follows since b.(ap) = a(b.p), for a,b <E R and p G _R[y4] |3]. The previous equivalence holds 
in particular for a = 1/m and p — (v\ H h z/ m ). □ 

The semi-invariance under translation follows since (ci jU + • ■ ■ + c m , u ) = — .m = x 
whose moments are all zero except the first. The connection between multivariate cumu- 
lants of v and those of V\ + ■ ■ ■ + v m is given in the following proposition. 

Proposition 4.4. We have k v = \.m.^.t v . 



Proof. We have (u+- ■ ■+u).p.c v = rn.f3.Cu and from A4.13]) we have m.f3.c u = v. Therefore 
from (14.7j) . we have u.f}.K u = m.fi.t v so that \.u.^.K u = x.m.fi.tv. The result follows 
since x.u.fi = it and X' u 'fi' K v = K v D 



5 Spectral /c-statistics 



Tukey [2T] introduced the multi-index fc-statistics in connection with finite-population sam- 
pling. He showed that the multi-index fcs are multiplicative in the limit as n — > 00, and 
that they are equal to the product of Fisher's single-index ks. In the ordinary iid setting 
considered by Fisher, this means that each multi-index k converges to a cumulant product. 

We now construct matricial polykays, indexed by an integer partition A, as unbiased 
estimators of cumulant products of trace powers of a random matrix Y. Then, when the 
random matrix Y is defined by sub-sampling as in section 2, that is when a spectral sample 
is considered, we will prove the inheritance property by assuming that the elements of the 
diagonal matrix X are umbrally represented by similar and uncorrelated umbrae. 
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Definition 5.1. The matricial polykay K\(y) of class Ah % is the symmetric polynomial in 
the eigenvalues y = (y%, . . . , y m ) such that 

1(A) 

E[K X (y)]=l[E[(c 1 , y + --- + c m ^]. 

3=1 

Set 1(A) = r. From Proposition 14.21 and equation ( 13. 141) . with n replaced by the umbra 
X, a first expression of cumulant products of trace powers in terms of moments of Y is 

n ma* +■■■+ e n ^ ( v * - i)! (5 - i} 

3=1 (?yihAi,...,7y r |-A r ) j=l 



where r]i + • • • + T] r = (ti,t 2 ,...) is the summation of the partitions {rjx, . . . ,T] r } and 
g Vl +- +Vr = U l j=i + '" +Vr) E[Tr(Y^)}. Equation takes into account that E[(x.xY] = 

(— l) i_1 (i — 1)! for all nonnegative integers i. 

A second expression which is more suitable for spectral sampling is in terms of joint 
moments of Y, that is in terms of products of its trace powers. To this end, we need to 
work with permutations (5j and with the group algebra -Rp4]((5j) on the polynomial ring 
R[A). 

A permutation a of [i], or a G &i the symmetric group, can be decomposed into disjoint 
cycles C(a). In the standard representation each cycle is written with its largest element 
first, and the cycles are listed in increasing order of their largest element [17J. The length of 
the cycle c G C(er) is its cardinality, denoted by 1(c). The number of cycles of a is denoted 
by | (7(c) | . Recall that a permutation a with r\ 1-cycles, r 2 2-cycles and so on is said to 
be of cycle class A = (l ri , 2 r2 , . . .) h i. In particular we have 1(A) = \C(a)\. The number of 
permutations o G &i of cycle class A = (V 1 , 2 r2 , . . .) h i is usually denoted by 

Sx = Tn — — i • ( 5 - 2 ) 

V 1 ri< 2 T2 r 2 ! • • • 

Each cycle class is a conjugacy class of the group of permutations: two elements of &i are 
conjugate if and only if they have the same cycle class. 

Consider the group algebra Aj = i?[^4]((3j). An element / G A, associates with each 
permutation a G &i a polynomial f(a) G R[A], so Aj is the space of i?[^4]-valued functions. 
Multiplication in Aj is the convolution: 

<J-9){*)= E M9(u)- (5-3) 

pUJ=(T 

The unitary element with respect to multiplication is the indicator function 5 such that 
5(e) = 1, with e the identity [i] — > [i], and zero otherwise. Indeed / ■ 5 = 5 ■ f = f for 
/ G Aj. If it exists, the inverse function of / in Aj is denoted by ,p ^ and is such that 

f ( - 1) -f = f-f ( - 1) = s. 

Denote by (a(Y) the function in Aj such that 

fi(Y)(a)= 11 Tr (Y'(°)) G R[A) (5.4) 
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for a matrix Y of order m and a G ©«. Evidently fi(Y)(a) is a product of power sums in 
the eigenvalues of Y, depending only on the cycle structure. In particular we have 

/i(F)(e) = [Ti(Y)] 1 and ^(/ m )(tr) = m) c W. (5.5) 

Theorem 5.2. Define the function k(y) G Aj by 

k{y) = ^I m ) { - 1] ■ fi(Y) (5.6) 

with fi(Y) and fi(I m ) given in ( |5.^| ) and A5.5\) respectively. Then 

& x (y)~[(l\y*(2\) r s---}k(y)(a) (5.7) 

is a matricial polykay of class A = (l ri , 2 r2 , . . .) h i, the cycle structure of a. 

Proof. Observe that by taking the expectation of both sides in ( 15. 6ft we have 

E[K{y)]=iJi{I m )- 1 *E[ M (Y)], (5.8) 

where * is the classical convolution on the space of i?-functions on (5j 

(a*6)(«r) = a (p) 6 M, 

with E[fx(Y)} :ffG6;4 £7[/x(y)(<r)] G and E[k{y)} : a E &i ^ E[k{y){a)\ G R. Then 
the symmetric polynomial &\(y) in (15.71) satisfies Definition 15.11 if also the function 

E[C V ) : a G 6< H. J] n j J5[(ci, y + --- + c m , y )^] 

is such that £?[C y ] = fi(I m )~ l * i£[//(y)]. From (Oil) , we have 

AH i=l 

From (15. 9p . by observing that 

rf A = g * (5.10) 

(l!) r 2(2!) r 3 • • • v ; 

with sa the number of permutations a G ©i of cycle class A = (l ri , 2 r2 , . . .) h z < m given 
in ( 15. 2j) . we have 

= £ m^WI^^Ka) = £ m^CJ^ 1 ), (5.11) 

where a -1 is the inverse of a. By using the action of ©j on the group algebra R(&i) we 
have i?[/i(Y)] = /i(/ m ) * i?[C y ]. For i < m the function cr h-» 77t,I c '( <t )I has an inverse [TU], so 
that £[C„] = nilm)' 1 * E[n(Y)\. □ 
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Note that equation (15.81) is given in pQ as the definition of the cumulants of a random 
matrix. By Theorem 15. 2\ we have shown that k{y) in (I5.8P are rather statistics, due also 
to the condition i < m which parallels the analogous condition for Fisher's ^-statistics. 

The statistics &\(y) are unbiased estimators of product of cumulants, due to Definition 
15.11 The inheritance on the average is indeed strictly connected to the spectral sampling, 
that is to the special structure of the matrix Y = H[ mXn ]X H?,. When &\(y) refers to 
spectral sampling, we call them spectral polykays. 

Definition 5.3 (Natural spectral statistics). A statistic is said to be natural relative to 
spectral sampling if, for each m < n, the average value of T m over spectral sub-samples y 
of x is equal to T n (x) : 

E(T m (y)\x)=T n (x). (5.12) 

Theorem 15.51 states that the spectral fc-statistics R\(y) are natural. 

We first give a proposition which moves from Lemma 7.2 in J3J- In this proposition, the 
evaluation operator E[- \~f] deals the elements of the n-tuple 7 as they were constants. A 
formal definition of E[- I7] may be found in [6]. 

Proposition 5.4. #"{71,72, • • • ,7n} ore uncorrected umbrae similar to the umbra 7, then 
E[( ll z 1 + l2 z 2 + --- + ln z n y\-f] = ^ d ^h) E [^} (5-13) 

where 7 = (71,72, • • • , 7n) and is the polynomial umbra whose moments are the power 
sums in the indeterminates {zi, z 2 , ■ ■ ■ , z n }, that is E[a l ] = z\ + z\ + • • • + z l n . 

Proof. The starting point is the result of Lemma 7.2 in [3] 

X>(7i*l H h InZn) = (x-l)v (5-14) 

where {71,72, ••• ,7n} are uncorrelated umbrae similar to 7. Equivalence (I5.14p may be 
rewritten as 

X-iliZi H h 1nZ n ) = I X---{li + 72 H h 7n) I cr, (5.15) 



(5.16) 



as 7 = -.n.7. Taking the dot-product with (3 on both sides of (" 15 . 15[) gives 



(71^1 H h lnZ n ) = 0- 



X---(Ti +72 H h7n) ) a 

n 



The result follows by using (13. 7ft with 7 replaced by the unity umbra u and by using 
Proposition 14.21 and Definition 15.11 In equation ( 13. 7p . the evaluation operator is intended 
to be replaced by E[ ■ I7]. □ 



Theorem 5.5 (Inheritance on the average). The statistics &\(y) in ( 5.1 ) are inherited on 
the average, that is 

E[&x(y)\x] =Si x (x) 
where y is a spectral random sample. 
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Proof. Since the trace is invariant under cyclic permutations, for a nonnegative integer i 
we have 



Tr(Y i 



Tr 



(H[ r , 



t 



Tr 



(XH, 



[mxn]^[ mX «] 



with Y given in Definition EU Therefore fi(Y) = fi(XB), with B = HL xn] H [m xn i a square 
matrix of dimension n independent of X. The random matrix B is an orthogonal projection 
on a m-dimensional subspace such that 



cGC*(<t) 



vi(c) 



] J Tr (H [mxn] Hl 



t )l(c) 
mxn]' 



li(I m )(a). 

(5.17) 



For a diagonal matrix X independent of B, and by using Proposition 15 A\ we have 

E\ii{XB)\x]=n{I m )-Kx). (5.18) 

Indeed, if in Proposition 15.41 the umbrae {71, . . . ,7 n } are replaced by the elements of the 
diagonal matrix X, and the indeterminates {zi,...,z n } by the diagonal entries of the 
matrix B, equation ( I5.13p may be updated as 



(5.19) 



AH 



Due to (I5.17p . we have E[a\] = m l( - x \ So again equation (" 15 . 1 8[) follows by using the action 
of &i on the group algebra R(&i). The result follows from Theorem 15 .2\ by observing that 



E[k(y)\ x] = E[^I m )- 1 ■ ti{XB)\x] = E[ii(I m )- 1 ■ »(I m ) ■ k(x)\ x] = R(x). 



□ 



Remark 5.6. The computation of /i(/ m ) -1 requires the solution of a system of m equations 
in m indeterminates fi(I m ) ■ /i(/ m ) _1 = /i(/ m ) _1 ■ /i(/ m ) = 5 with coefficients given by 
/i(J m ). This task may be performed with standard procedures in any symbolic package. A 
different way consists in resorting to the so-called Weingarten function on <5j. See [2] for 
the definition and the properties of the Weingarten function, which involves the characters 
of &i and Schur symmetric polynomials indexed by A h i. 



The spectral fc-statistics can be expressed on terms of power sums S r = Y^j=i x ] as 
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follows: 



%) = — = k W 

nS 2 - Sf fc (2 ) 



%2) 
•^(13) 



n(n 2 -l) (ra + 1) 
nS 2 - S 2 

n (n 2 — 1) (n + 1) 

n 2Sf -3nS l S 2 + n 2 S 3 2k {3) 



n (n 2 - 1) (n 2 - 4) (n + 1) (n + 2) 
-2nS 3 + (n 2 + 2)S 1 S 2 -nSf = 2fc (1 , 2) - wfc^fcg) 

n (n 2 - 1) (n 2 - 4) ~~ (n+l)(n + 2) 

Sf(n 2 - 2) - 3 n S 1 S 2 + 4 S 3 _ 2k (l3) - 3k (1) k (2) + n(n + 3)(A; (1) ) 3 



n (n 2 - 1) (n 2 - 4) (n + l)(ra + 2) 

The functions of degree 4 are a little more complicated: 

_ S 4 {n 3 + n) - ASxS^n 2 + 1) + g|(3 - 2 n 2 ) + 10 n S 2 S 2 - 5Sf 
(4) ~ n 2 (n 2 - 1) (n 2 - 4) (n 2 - 9) 

-3nS A {n 2 + 1) + gigaOj + 3n 2 + n 4 ) + S 2 (6n 2 - 9) - 3 n S 2 S 2 {n 2 + 1) 
(1 ' 3) ~ n 2 {n 2 - 1) (n 2 - 4) (n 2 - 9) 

2(2n 2 - 3)S 4 



-£(2 2 ) = 
+ 

% 2 ,2) = 
%4) = 

+ 



n 2 (n 2 - 1) (n 2 - 4) (n 2 - 9) 
2 £ 4 (3 n - 2 n 3 ) + 4 5i5 3 (4n 2 - 6) + (18 + n 4 - 6n 2 ) - 2 n S 2 S 2 {n 2 + 6) 

n 2 (n 2 - 1) (n 2 - 4) (n 2 - 9) 

(n 2 + 6)5f 



n 2 (n 2 - 1) (n 2 - 4) (n 2 - 9) 

10ng 4 - ^SiSsjn 2 + 1) + (n 2 + 6) + nSfS 2 {n 2 + 1) + (4 - n 2 )Sj 

n(n 2 -l)(n 2 -4) {n 2 - 9) 
-30nS 4 + 4SiS 3 (4n 2 - 6) + S 2 (3n 2 + 18) + 6nS?S 2 (4 - n 2 ) 
n 2 (n 2 - 1) (n 2 - 4) (n 2 - 9) 
(6-8n 2 + n 4 )S 4 



n 2 (n 2 - 1) (n 2 - 4) (n 2 - 9) ' 



For comparison purposes, all of the single-index functions &( r )(x) and the fc-statistics k( r )(x) 
for r > 2 are invariant under translation: &( r )(x — x) — &( r )(x). If the mean is zero, the 
fourth-order statistics are 

(71)4^(4) = n 2 (n + 1)5*4 — 3n(n — l)^ 
%) oc n(n 2 + 1)S 4 - (2n 2 - 3)S 2 , 

showing that £(4) is not a simple multiple of k^y 
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For a spectral sample, the first few conditional variances and covariances are 



var(%)(?/ 1 x) 
cov(%) (y),& {2 )(y) I x) 
var(%)(2/ 1 x) 



which are similar to the covariances of the corresponding ^-statistics. 

We now characterize the limiting behavior of spectral polykays. To this end, we recall 
the notion of free cumulant occurring within non-commutative probability theory [13J. A 
noncommutative probability space is a pair (A,$), where A is a unital noncommutative 
algebra and $ : A — > C is a unital linear functional. This gives rise to a sequence of 
multilinear functional on A via $j(ai, . . . , a^) = $(cii • • • ctj). 

Let MC denote the lattice of all noncrossing partitions of A noncrossing partition 
7r = {Bi, B 2 , . . . , -Bfc} of the set [i] is a partition such that ifl</i</<s</c<i, with 
h,s G B n and I, k G -B n /, then n = n' . For any noncrossing partition 7r and a 1; . . . , a, G ^4 
we set 

$ w (a x , . . . , Oi) = $(0^ • ■ ■ a is ) 

for 5 = (ji < ■ • • < j s ). Free cumulants are defined as multilinear functionals such that 

£V(ai, . . . ,ai) = ciB^a^ • • -ajj 

and 

Ci(oi, . . . , Oi) = m(7r, lj) $„.(ai, . . . , Oj) 

where m(7r, lj) is the Moebius function on the lattice of noncrossing partitions [13]. The 
i-th cumulants of a is q = Cj(a, . . . , a). 

By using Proposition 6.1 in p], when m goes to infinity, the mean of the Normalized 
spectral ^-statistics corresponding to A = (l ri , 2 r2 , . . .) h i 

k[ N \y) :=m 1 - 1 ^ R x (y) 

tends towards the product of free cumulants c r ^c r 2 2 --- with R\(y) : = R(y)(a), given in 
(I5.6p . and a a permutation of class A. 

6 Generalized spectral polykays 

The notion of generalized cumulant has been discussed by McCullagh [TT] and involves 
set partitions. In umbral terms, if 7r is a partition of {/ii,/i2, . . . then the generalized 



&{2){X)( ) 



2%)^) 

\m n 



1 1 

n 

\m z — 1 — 1 
, . (n — m)(2m 2 n 2 — 3n 2 — 3m 2 — ran + 3) 
nra[ra z — lHrr — 1) 
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cumulant Kjjr IS defined as [3J 

E [iX-y)v\ = ^ with (x-^)w = 1 [ (x-A*b) and \i B = JJ /^-. 

Belli jeB 

For example if i = 5 and 7r = {{yUi,/i 2 }, {^3}, {^4, /is}} then 

E[( X .ti)«] = E[{ X .^2){X-^){X^^)\ = ^ 12,3,45 

using McCullagh's notation. Generalized fc-statistics are the sample version of the general- 
ized cumulants. The importance of generalized ^-statistics stems from the following prop- 
erties: the generalized fc-statistics are linearly independent; every polynomial symmetric 
function can be expressed uniquely as a linear combination of generalized fc-statistics; any 
polynomial symmetric function whose expectation is independent of n can be expressed as 
linear combination of generalized fc-statistics with coefficients independent of n [12]. Due 
to the last property, natural statistics could be expressed as linear combination of their 
generalized fc-statistics with coefficients independent of n. 

Theorem 6.1. If Ah i <m and it is a set partition of class \, then generalized k-statistics 
of spectral polykays are 

/^-^(-ljH-HM-l)!^) (6.1) 

r>7r 

where K T (y) denotes the function on Ilj such that k{y){r) := k(y)(a), with o G &i a 
permutation of the same class of t and h(y) given in A5.6\) . 

The proof of Theorem 16 . 1 1 relies on Proposition 5.4 of [3]. We do not invert equivalence 
( 16. 4 p because the linear combination giving spectral polykays in terms of their generalized 
^-statistics is quite cumbersome, see equation (3.18) in [12]. Instead, there are alternative 
systems of symmetric functions that are more suitable from a computational point of view. 
All such systems are invertible linear functions of generalized ^-statistics with coefficients 
independent of the sample size and the properties given above are preserved under such 
transformations. 

To characterize such coefficients, we first recall the Moebius inversion formula on the 
lattice of set partitions |15] . The set IT with the refinement order < is a lattice, where 
71 < t if for any block in B e 7r there exists a block B' G r such that B C B' . If G is a 
function on II j and 

F(n) = J2G(r) 

T>7T 

then the Moebius inversion formula states that 

G(7r) = ^m(7r,r)F(r) (6.2) 

T>7T 

where m(7r, r) is the so-called Moebius function. It is shown that 

m(7r,r) = (-1)^(2!)^ (3!) r4 •• • 
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where t\ + 2r 2 + • • • = s = \tt\, r± + r 2 + • • • = t — |r| and (l n , 2 T2 , . . .) is the partition, 
usually denoted by \(n, r), of the integer s such that Tj blocks of r contain exactly j blocks 

Of 7T. 

Definition 6.2. If A h i < m and n is a set partition of class X, the (transformed) 
generalized k-statistics of spectral polykays are l T (y) such that 



Mi/) = ^2 l r(y), (6.3) 

T>7T 



where K T (y) denotes the function on IT such that K,(y)(r) := n(y)(o~), with o e (3, a 
permutation of the same class of r and k(y) given in $5.6\) . 

The linear combination in (16.31) is very simple involving coefficients all equal to 1. By 
using the Moebius inversion formula (16.21) . from (16.31) we have 

L(y) = ^2ra(n,T)R T (y) (6.4) 

T>7T 

Since k(y)(a) depends only on the cycle structure C(o~), then k T (y) depends only on 
the block sizes in r. So in the sum (16.41) . there are d\ spectral /c-statistics equal to k T (y), 
all those having the same class A. Therefore spectral polykays of degree i can be indexed 
by partitions of i. As example, by Definition I6.1[ the spectral polykays up to order 4 are 

= &(i) i = 1 

Z(i2) = £(12) — k(2) i = 2 

^(1,2) = &(1,2) — ^(3) i = 3 

Z(!3) = K (1 3) — 3K( 1j2 ) + 2/t( 3 ) 

1(1,3) = «(1,3) - «(4) * = 4 

/( 2 2) = «( 2 2) - K( 4 ) 

^(1 2 ,2) = ^(1 2 ,2) - 2ft(l,3) - «(2 2 ) + 2K(4) 

/ (l4) = K (1 4) - 6K(i2 )2 ) + 8/5(1,3) + 3K( 2 2) - 6/5(4) 

In addition, we have /(«) = Ku). We take a moment to motivate this definition. Tukey 
[2T] gives very similar equations connecting classical polykays and /c-statistics. We just 
recall those up to order 4. 
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k(i) = % = 1 

k {l2) = k {1) k {l) -—k {2) i = 2 

T 

k { i,2) = k ( i ) k (2) --k (3) z = 3 

3 2 

k(is) = k {1) k (1) k (1) - —k (2) k {1) + ^fc (3) 

%,3) =fc(i)fc( 3 )- — fc( 4 ) * = 4 

1 1 

— 1 1 

fc(2») = -^k(2)k(2) fc( 4 ) 

m + 1 m 

2 m — 1 2 

k(i2,2) = k {2) k {l) k (1) fc(3)*(i) ; -^k {2 )k {2) + — — — -fc (4) 

m m[m + 1) m(m + 1) 

, , , , , 6 8,, 3(m — 1) 6m 

fc(i4 ) = A; ( i ) A; ( i ) A; (1) A; ( i ) - -k m k {1) k {1) + ^k m k {1) + ^ - ^ fc( 2) fc( 2: 



m + 1 



(4) 



The two sets of equations are very similar in structure. 

The refinement order in (16. 3ft is inverted with respect to that connecting moments and 
cumulants [H] . It is the same as that employed in the change of basis between augumented 
symmetric functions and power sums [3] and employed by Tukey in order to show the 
multiplicative structure of ci\(x) for infinite populations. 

In terms of power sums in the eigenvalues, the transformed generalized spectral polykays 
up to order 4 are: 



*(1,2) 
l(l 2 ,2) 



[m 



+ l)SiS 2 - Sf - mS 3 



(22) 



m(m — l)(m + l)(m — 2) 

2mg 4 + (m + 3)SfS 2 - (2m + 2)S 1 S 3 - mSj - Sf 
m(m — l)(m + l)(m — 2)(m — 3) 
_ Si 4 + (-3 m + 3 + m 2 ) S 2 2 + (4 m - 4) - 2 S x 2 S 2 m + (-m 2 + m) S 4 

m 2 (n 2 — 1) (m — 2) (m — 3) 

2 

= 7^ J^7—2 7V"^ {-^m(m 2 + 1) + S^sim 3 + m 2 + 4) 

+ S 2 2 (2m 2 -3)-m^S 2 (3m + l) + Sj(2m-l)} 



The spectral statistics 1^ are the same as the corresponding polykays fc(ir). 

Theorem 6.3. When m goes to infinity the mean of the Normalized (transformed) gener- 
alized k-statistics 

li N \y) :=m^Uy), 

tends to d\c r ^c r 2 ---, with A = (l ri , 2 r2 , . . .) h i ; i/ie c/ass partition of n, and {cj} free 
cumulants. 
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Proof. After multiplying both sides of (16. 4ft by m l we have 



T>7T 



Since t > tt then |r| < \tt\ so that 



m kl-b 

r>7r 



As m goes to infinity, for all r > n having the same class partition, m l '^k T (y) tends 
towards the free cumulant c T , whereas m \ J\-\ T \ goes to zero. The result follows since for 
r = ix we have m(7r,r) = 1 and for all tt having the same class partition m^^^fj/) goes 



to c n = d?c r 2 ■ ■ ■ . □ 
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